Improved low bit-rate audio compression using reduced rank ICA instead of psychoacoustic modeling

نویسندگان

  • Adiel Ben-Shalom
  • Michael Werman
  • Shlomo Dubnov
چکیده

Traditional audio coding is based on a perceptual compression paradigm that exploits psychoacoustic information to efficiently encode audio signals. Recently, extensive research has been conducted in order to understand how the brain encodes natural signals. These results suggest that the encoding process is very efficient in terms of redundancy reduction of the signal information. It could be that the psychoacoustic effects (such as the masking effect) are only a special case of a more general redundancy reduction mechanism that exists in the auditory pathway. Motivated by this work we propose a new audio coding scheme that is based on improved sound representation found by Independent Component Analysis. Using a local linear, low rank, non-orthogonal transform, we remove additional redundancies in the signal. At low bitrates this coding scheme gives results superior to a legacy perceptual encoding scheme for different kinds of audio signals.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study of Mutual Information in Perceptual Coding with Application for Low Bit-rate Compression

In this paper we analyze this aspect of redundancy reduction as it appears in MPEG1-Layer 1 codec. Specifically, we consider the mutual information that exists between filter bank coefficients and show that the normalization operation indeed reduces the amount of dependency between the various channels. Next, the effect of masking normalization is considered in terms of its compression performa...

متن کامل

An adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria

This paper outlines an adaptive wavelet-based perceptual audio coding scheme attending to various entropy-type criteria. Its performance using some different wavelet families and various filter lengths and decomposition depths has also been investigated. An optimal choice of these parameters is accomplished in order to evaluate both quality and bit rate of compressed signals for four different ...

متن کامل

Improved audio coding using a psychoacoustic model based on a cochlear filter bank

Perceptual audio coders use an estimated masked threshold for the determination of the maximum permissible just-inaudible noise level introduced by quantization. This estimate is derived from a psychoacoustic model mimicking the properties of masking. Most psychoacoustic models for coding applications use a uniform (equal bandwidth) spectral decomposition as a first step to approximate the freq...

متن کامل

PEAQ based Psychoacoustic Model Implementation using Wavelet Packet Decomposition

Audio compression is the lossy compression technique of converting audio signal into an efficiently encoded bitstream that can be decoded to produce a close approximation of the original signal. For the purpose of improving the coding this work attempts to combine psychoacoustic model for perceptual evaluation of audio quality in BS.1387 with perceptual audio coder. The implementation of this n...

متن کامل

Psychoacoustic-based quantisation of spatial audio cues

The derivation of spatial cues representing source localisation information is a typical component of multichannel spatial audio coders. Efficient compression of spatial cues based on psychoacoustic localisation features is investigated. Results show that the proposed quantisation approach for spatial cue compression achieves bit-rates of less than 6 kbit/s while preserving critical source loca...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003